Sample size determination in microarray experiments for class comparison and prognostic classification.

نویسندگان

  • Kevin Dobbin
  • Richard Simon
چکیده

Determining sample sizes for microarray experiments is important but the complexity of these experiments, and the large amounts of data they produce, can make the sample size issue seem daunting, and tempt researchers to use rules of thumb in place of formal calculations based on the goals of the experiment. Here we present formulae for determining sample sizes to achieve a variety of experimental goals, including class comparison and the development of prognostic markers. Results are derived which describe the impact of pooling, technical replicates and dye-swap arrays on sample size requirements. These results are shown to depend on the relative sizes of different sources of variability. A variety of common types of experimental situations and designs used with single-label and dual-label microarrays are considered. We discuss procedures for controlling the false discovery rate. Our calculations are based on relatively simple yet realistic statistical models for the data, and provide straightforward sample size calculation formulae.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis

In gene expression microarray data analysis, biclustering has been demonstrated to be one of the most effective methods for discovering gene expression patterns under various conditions. We present in this study a framework to take advantage of the homogeneously expressed genes in biclusters to construct a classifier for sample class membership prediction. Extensive experiments on 8 real cancer...

متن کامل

عوامل پیش‌بینی کننده خون‌ریزی مجدد در خون‌ریزی از واریس مری در بیماران بستری در بخش گوارش بیمارستان امام خمینی 87-86

Background and Objective: Esophageal variceal bleeding is associated with a high mortality rate and expensive hospitalization costs. By diagnosing predicting factors of rebleeding at admission, and proper course of action, we can minimize the rates of mortality rebleeding. The aim of this study was to determine the predicting factors of rebleeding in patients hospitalized because of variceal he...

متن کامل

SFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy

 In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification....

متن کامل

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

جاسازی خط ویژگی وزن‌دار برای استخراج ویژگی تصاویر ابرطیفی

One of the most preprocessing steps before the classification of hyperspectral images is supervised feature extraction. Because obtaining the training samples is hard and time consuming, the number of available training samples is limited. We propose a supervised feature extraction method in this paper that is efficient in small sample size situation. The proposed method, which is called weight...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biostatistics

دوره 6 1  شماره 

صفحات  -

تاریخ انتشار 2005